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<210> 1 

<211> 450 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
promoter sequence 

<400> 1 



ctttttcggc 


atgagcaacc 


aacattttca 


aggtatcatc 


ctgatgcgca 


atatcggcat 


60 


cggttagcca 


taaccatttt 


acctgtccgg 


cggccttaat 


acct tgatca 


gatggttcgt 


120 


ggtgttgtta 


ccttgccgaa 


gggcaccggt 


aaaaatgttc 


gcgtcggtgt 


tttcgcccgt 


180 


ggcccgaaag 


ctgaagaagc 


taaagctgct 


ggtgcagaag 


ttgtcggcgc 


agaagacctg 


240 


atggaagcca 


ttcagggcgg 


cagcattgat 


ttcgatcgtg 


atgcccttta 


tactgaaatt 


300 


gccttgcgct 


gccataatga 


agcagcctcc 


ggtgttttgg 


cagatttaag 


cgctgcctga 


360 


ttttcgtgat 


cctctagagt 


ctatgaaatg 


gagattcatt 


tatgcctctc 


tcttattcgg 


420 


ataaccatcc 


agtcatccgc 


aagcttggcc 








450 



<210> 2 

<211> 1509 

<212> DNA 

<213> Artificial Sequence 



2 

<220> 

<223> Description of Artificial Sequence: Synthetic 
expression vector 

<400> 2 

gatcaaccgg caatttatcc acggcatcaa attcgatctg tcttttcccg tatcattggc 60 

aataccggca ttctgattac aggccgtgtt ttgaatgcgg tatgcagttt tgtctatgtc 120 

gcatggacat cccagacatt gggattgaac ctgtttggtg tcatgctttt gattacgact 180 

tttgctaccc tgatttcgga tattacccgt tttcagtcat ggcaaacctt gctgcattac 240 

ggttcaaaag cttttcagga aaaagatttt aaccaatttg atgatgtcct tgccttttgc 300 

atcagagccg atttttttag tgcggcgata ggtatgttgg tagggttagg cggtatcttg 360 

attttaggca cttcaagatt gggatggcct gccgaggtca agccagatgc cttgctttgt 420 

atgctgatta tactttttat gaatatcggc tggtccaacc gggatgttgc ggctgtgtaa 480 

ccgctttaaa ctggtcacta tttatgagtt tattacgacc tgcgtcagaa ccggaggttg 540 

tggcattggt tattggcttc atatgccttt ggggtatttt ttgtttatat ggtgcctgac 600 

gcaattcacg ctttttgtca cctgtagtta cgctggcatt tatctctttc accaatatac 660 

ggagcgagca tttccgataa gaaaaatatt tcagagaaaa acgcccgttg aagggatgtg 72 0 

gaaattcact ttaagcgtca gttttaatga aatcctagac tccattttcc agcagggtgg 780 

cacccttgct attggtagct cactgggggc tggggaagcc gctgtctatc gggtcgcgcg 840 

ccagattagt aacggtttat ccaaaccagc acagatgatg atcggctaac atgcatccac 900 

cggcagcacc ggccgtttta tgcttgggat tattgatatg ccgaaaagga tacaacatct 960 

ggaagaaaaa gacgaaggcc ggaataagcg cccattctgc aaaattgtta caacttagtc 1020 

gcgccatcag ggaatgaaaa atcaatccgt ctttttcggc atgagcaacc aacattttca 1080 

aggtatcatc ctgatgcgca atatcggcat cggttagcca taaccatttt acctgtccgg 1140 

cggccttaat accttgatca gatggttcgt ggtgttgtta ccttgccgaa gggcaccggt 1200 

aaaaatgttc gcgtcggtgt tttcgcccgt ggcccgaaag ctgaagaagc taaagctgct 1260 

ggtgcagaag ttgtcggcgc agaagacctg atggaagcca ttcagggcgg cagcattgat 1320 

ttcgatcgtg atgcccttta tactgaaatt gccttgcgct gccataatga agcagcctcc 1380 

ggtgttttgg cagatttaag cgctgcctga ttttcgtgat cctctagagt ctatgaaatg 1440 

gagattcatt tatgcctctc tcttattcgg ataaccatcc agtcatccgc aagcttggcc 1500 

gtaatccat 1509 



3 



<210> 


3 


<211> 


27 


<212> 


DNA 


<213> 


Artificial 



<220> 

<223> Description of Artificial Sequence: Synthetic primer 
<400> 3 

cgaattcctg ccgaagttta ttagcca 27 



<210> 


4 


<211> 


31 


<212> 


DNA 


<213> 


Art if icial 



<220> 

<223> Description of Artificial Sequence: Synthetic primer 
<400> 4 

aaggatcctt ccaccagcta tttgttagtg a 31 



<210> 


5 


<211> 


27 


<212> 


DNA 


<213> 


Artificial 



<220> 

<223> Description of Artificial Sequence: Synthetic primer 
<400> 5 

agaattctgc cagttggttg acgatag 27 



<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic primer 
<400> 6 

caggatcccc tcaagtcact agttaaactg 30 



<210> 7 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic primer 



<400> 7 
taatacgact 



cactataggg 



20 



<210> 


8 


<211> 


19 


<212> 


DNA 


<213> 


Artificial 



<220> 

<223> Description of Artificial Sequence: Synthetic primer 
<400> 8 

cacgacgttg taaaacgac 19 



<210> 


9 


<211> 


18 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Description of Artificial 


<400> 


9 


taacaatttc acacagga 


<210> 


10 


<211> 


30 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Description of Artificial 
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<400> 10 

gactggatgg ttatccgaat aagagagagg 3 0 



<210> 


11 


<211> 


8 


<212> 


DNA 


<213> 


Art if icial 



<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide linker 

<400> 11 
ggcgcgcc 



8 



<210> 12 

<211> 11544 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic nucleotide 
vector sequence 

<220> 
<221> CDS 

<222> (1452) . . (2735) 
<220> 

<221> modif ied_base 
<222> (3282) . . (4281) 

<223> a, c, g, t, unknown, or other 

<220> 
<221> CDS 

<222> (4916) . . (5776) 

<220> 
<221> CDS 

<222> (7061) . . (8251) 
<220> 

<221> modif ied_base 
<222> (9476) . . (11544) 

<223> a, c, g, t, unknown, or other 
<400> 12 

gatcaaccgg caatttatcc acggcatcaa attcgatctg tcttttcccg tatcattggc 60 

aataccggca ttctgattac aggccgtgtt ttgaatgcgg tatgcagttt tgtctatgtc 120 

gcatggacat cccagacatt gggattgaac ctgtttggtg tcatgctttt gattacgact 180 

tttgctaccc tgatttcgga tattacccgt tttcagtcat ggcaaacctt gctgcattac 240 

ggttcaaaag cttttcagga aaaagatttt aaccaatttg atgatgtcct tgccttttgc 300 

atcagagccg atttttttag tgcggcgata ggtatgttgg tagggttagg cggtatcttg 360 

attttaggca cttcaagatt gggatggcct gccgaggtca agccagatgc cttgctttgt 420 

atgctgatta tactttttat gaatatcggc tggtccaacc gggatgttgc ggctgtgtaa 480 

ccgctttaaa ctggtcacta tttatgagtt tattacgacc tgcgtcagaa ccggaggttg 540 

tggcattggt tattggcttc atatgccttt ggggtatttt ttgtttatat ggtgcctgac 600 

gcaattcacg ctttttgtca cctgtagtta cgctggcatt tatctctttc accaatatac 660 

99^9cgagca tttccgataa gaaaaatatt tcagagaaaa acgcccgttg aagggatgtg 720 

gaaattcact ttaagcgtca gttttaatga aatcctagac tccattttcc agcagggtgg 780 

cacccttgct attggtagct cactgggggc tggggaagcc gctgtctatc gggtcgcgcg 84 0 

ccagattagt aacggtttat ccaaaccagc acagatgatg atcggctaac atgcatccac 900 

cggcagcacc ggccgtttta tgcttgggat tattgatatg ccgaaaagga tacaacatct 960 



6 



ggaagaaaaa gacgaaggcc ggaataagcg cccattctgc aaaattgtta caacttagtc 1020 

gcgccatcag ggaatgaaaa atcaatccgt ctttttcggc atgagcaacc aacattttca 1080 

aggtatcatc ctgatgcgca atatcggcat cggttagcca taaccatttt acctgtccgg 1140 

cggccttaat accttgatca gatggttcgt ggtgttgtta ccttgccgaa gggcaccggt 1200 

aaaaatgttc gcgtcggtgt tttcgcccgt ggcccgaaag ctgaagaagc taaagctgct 1260 

ggtgcagaag ttgtcggcgc agaagacctg atggaagcca ttcagggcgg cagcattgat 13 2 0 

ttcgatcgtg atgcccttta tactgaaatt gccttgcgct gccataatga agcagcctcc 1380 

ggtgttttgg cagatttaag cgctgcctga ttttcgtgat cctctagagt ctatgaaatg 1440 

gagattcatt t atg cct etc tct tat teg gat aac cat cca gtc ate gat 1490 

Met Pro Leu Ser Tyr Ser Asp Asn His Pro Val lie Asp 
15 10 

age caa aaa cae gee cca cgt aaa aaa ctg ttt eta tct tgt gee tgt 1538 
Ser Gin Lys His Ala Pro Arg Lys Lys Leu Phe Leu Ser Cys Ala Cys 
15 20 25 

tta gga tta age ctt gcc tgc ctt tee agt aat gee tgg gcg agt gtt 1586 
Leu Gly Leu Ser Leu Ala Cys Leu Ser Ser Asn Ala Trp Ala Ser Val 
30 35 40 45 

gag ccg tta tec gtt age ggc aat aaa ate tac gea ggt gaa aaa gee 1634 
Glu Pro Leu Ser Val Ser Gly Asn Lys lie Tyr Ala Gly Glu Lys Ala 

50 55 60 

aaa agt ttt gcc ggc aac age tta tte tgg agt aat aat ggt tgg ggt 1682 
Lys Ser Phe Ala Gly Asn Ser Leu Phe Trp Ser Asn Asn Gly Trp Gly 

65 70 75 

999 9sa aaa ttc tac aca gcc gat ace gtt gcg teg ctg aaa aaa gac 173 0 

Gly Glu Lys Phe Tyr Thr Ala Asp Thr Val Ala Ser Leu Lys Lys Asp 
80 85 90 

tgg aaa tec age att gtt egc gcc get atg ggc gtt cag gaa age ggt 1778 
Trp Lys Ser Ser lie Val Arg Ala Ala Met Gly Val Gin Glu Ser Gly 
95 100 105 

ggt tat ctg cag gac ccg get ggc aac aag gcc aaa gtt gaa aga gtg 1826 
Gly Tyr Leu Gin Asp Pro Ala Gly Asn Lys Ala Lys Val Glu Arg Val 
110 115 120 125 

gtg gat gcc gea ate gcc aac gat atg tat gtg att att gac tgg cae 1874 
Val Asp Ala Ala lie Ala Asn Asp Met Tyr Val lie He Asp Trp His 

130 135 140 



tea cat tct gea gaa aac aat egc agt gaa gcc att cgc ttc ttc cag 
Ser His Ser Ala Glu Asn Asn Arg Ser Glu Ala He Arg Phe Phe Gin 

145 150 155 



1922 
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gaa atg gcg 
Glu Met Ala 
160 

tac aac gag 
Tyr Asn Glu 
175 

gcc gaa gcc 
Ala Glu Ala 
190 

att att gtc 
lie He Val 



cgc gat cca 
Arg Asp Pro 



gcg gga acc 
Ala Gly Thr 
240 

aat aac ggt 
Asn Asn Gly 
255 

gac ggc aat 
Asp Gly Asn 
270 

ttc atg cgt 
Phe Met Arg 



aaa age gaa 
Lys Ser Glu 



gag teg ggt 
Glu Ser Gly 
320 

gcg ggc age 
Ala Gly Ser 
335 

atg gca cca 
Met Ala Pro 
350 

get gat tgc 
Ala Asp Cys 



cgc aaa tat 
Arg Lys Tyr 



ccg ctt cag 
Pro Leu Gin 



gtg att tec 
Val He Ser 
195 

ggt acg ccc 
Gly Thr Pro 
210 

ate aac gcc 
He Asn Ala 
225 

cat ggt gag 
His Gly Glu 



att gcg ctt 
lie Ala Leu 



ggc gga gtg 
Gly Gly Val 
275 

gac aac aac 
Asp Asn Asn 
290 

999 9c^ tea 
Gly Ala Ser 
305 

aaa ata gta 
Lys He Val 



gcc gcc agt 
Ala Ala Ser 



ccg ttg acg 
Pro Leu Thr 
355 

tgc aat gcc 
Cys Asn Ala 
370 



ggc aac aag 
Gly Asn Lys 
165 

gtt tea tgg 
Val Ser Trp 
180 

gcc att cgc 
Ala He Arg 



agt tgg teg 
Ser Trp Ser 



aag aat ate 
Lys Asn He 
230 

tea tta cgc 
Ser Leu Arg 
245 

ttc gtc acc 
Phe Val Thr 
260 

aac cag aca 
Asn Gin Thr 



ate age aac 
He Ser Asn 



acc tat tat 
Thr Tyr Tyr 
310 

aaa teg ate 
Lys Ser He 
325 

aca aca acc 
Thr Thr Thr 
340 

aac cga cca 
Asn Arg Pro 



aac gtt tac 
Asn Val Tyr 



ccg aat gtc 
Pro Asn Val 



age aat acc 
Ser Asn Thr 
185 

gcc att gac 
Ala He Asp 
200 

caa aac gtt 
Gin Asn Val 
215 

gcc tat acg 
Ala Tyr Thr 



act aaa gcc 
Thr Lys Ala 



gag tgg ggc 
Glu Trp Gly 
265 

gat acc gac 
Asp Thr Asp 
280 

gca aac tgg 
Ala Asn Trp 
295 

ccg gac tct 
Pro Asp Ser 



att caa age 
He Gin Ser 



gat cag tea 
Asp Gin Ser 
345 

caa ccg aca 
Gin Pro Thr 
360 

ccc aac tgg 
Pro Asn Trp 
375 



att tat gaa 
He Tyr Glu 
170 

att aaa cct 
He Lys Pro 



ccg gat aac 
Pro Asp Asn 



gat gaa gcg 
Asp Glu Ala 
220 

ctg cat ttc 
Leu His Phe 
235 

cgc cag gcg 
Arg Gin Ala 
250 

gcc gtt aac 
Ala Val Asn 



gcc tgg gta 
Ala Trp Val 



gcg tta aat 
Ala Leu Asn 
300 

aaa aac ctg 
Lys Asn Leu 
315 

tgg cca tat 
Trp Pro Tyr 
330 

acc gat acc 
Thr Asp Thr 



cac egg caa 
His Arg Gin 



gtt age aaa 
Val Ser Lys 
380 



ate 1970 
He 



tat 2018 
Tyr 

ctg 2066 

Leu 

205 

teg 2114 
Ser 



tac 2162 
Tyr 



tta 2210 
Leu 



gcg 2258 
Ala 



acg 2306 

Thr 

285 

gat 2354 
Asp 



acc 2402 
Thr 



aaa 2450 
Lys 



acc 2498 
Thr 



acc 2546 

Thr 

365 

gac 2594 
Asp 



s 

tgg gcg ggc egg cag cga etc ata acg aag cag gcc aat cga teg tct 2642 
Trp Ala Gly Arg Gin Arg Leu lie Thr Lys Gin Ala Asn Arg Ser Ser 

385 390 395 

aca aag gga acc tgt ata ccg caa act ggt aca ctt cat ccg ttc egg 2690 
Thr Lys Gly Thr Cys lie Pro Gin Thr Gly Thr Leu His Pro Phe Arg 
400 405 410 

gca gcg att cet cct ggg eac agg ttg gta get gta act aat tga 2735 
Ala Ala lie Pro Pro Gly His Arg Leu Val Ala Val Thr Asn 
415 420 425 

ttaatctttt cacccccaaa ataaeagggc tgcgattgca gcctgatacg caacattcca 2795 

ttacttaatt gegtteaaaa gcgcecaaat ccggtgcgct gcettgtaac taatatgatt 2855 

tetctttcgt aeccgcgtta atcagctttg agttagccga cagacggaac agcgaggttg 2915 

ceggcaacgt gccgtcatta tcacgagata eggtagccag cgaggtgt cc aggctgacga 2 975 

stcggacgcg gaagccgetg tecgtatcca tgagttgact cgeatccgca ttactgaccg 3035 

ttgcagaagc agacagagac acgttgttgc ggaagtaatg tttctgtcct gaetggacgt 3095 

tgctcccgaa agcataatta atgccgtttt tatatgacgt gttatttatt acegtacgcc 3155 

gccgcgttat tgttctggtc aaaacctttg ctcacgttgc eaaaegegae geaaegggta 3215 

atgcgatgat tgccgacege tggttcctcc cagtttgaac ccgttggcat tgeeggcgaa 3275 

cgcgctnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3335 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 3 95 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 455 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 515 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 575 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 63 5 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 6 95 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 7 55 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 815 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 875 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 93 5 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 9 95 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4 055 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4115 



nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4175 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 423 5 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngatc ctctagagtc 4 2 95 

gacctgcagg aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt 4355 

tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga 4415 

ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgcctgat 4475 

gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcataggcg cgcctatggt 4535 

gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 4595 

cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 4655 

tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 4715 

gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 4775 

cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 4835 

tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 4895 

aatattgaaa aaggaagagt atg agt att caa cat ttc cgt gtc gcc ctt att 4948 

Met Ser lie Gin His Phe Arg Val Ala Leu lie 
430 435 

ccc ttt ttt gcg gca ttt tgc ctt cat gtt ttt get cac cca gaa acg 4996 
Pro Phe Phe Ala Ala Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr 
440 445 450 

ctg gtg aaa gta aaa gat get gaa gat cag ttg ggt gca cga gtg ggt 5044 
Leu Val Lys Val Lys Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Gly 
455 460 465 470 

tac ate gaa etg gat etc aac age ggt aag ate ctt gag agt ttt cge 5092 
Tyr lie Glu Leu Asp Leu Asn Ser Gly Lys lie Leu Glu Ser Phe Arg 

475 480 485 

ecc gaa gaa cgt ttt cca atg atg age act ttt aaa gtt etg eta tgt 5140 
Pro Glu Glu Arg Phe Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys 

490 495 500 

ggc gcg gta tta tec cgt att gae gcc ggg caa gag caa etc ggt cge 5188 
Gly Ala Val Leu Ser Arg lie Asp Ala Gly Gin Glu Gin Leu Gly Arg 
505 510 515 

cge ata cac tat tet cag aat gac ttg gtt gag tac tea cca gtc aca 5236 
Arg lie His Tyr Ser Gin Asn Asp Leu Val Glu Tyr Ser Pro Val Thr 
520 525 530 

gaa aag cat ctt acg gat ggc atg aca gta aga gaa tta tgc agt get 5284 
Glu Lys His Leu Thr Asp Gly Met Thr Val Arg Glu Leu Cys Ser Ala 
535 540 545 550 
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gcc ata acc atg agt gat aac act gcg gcc aac tta ctt ctg aca acg 5332 
Ala lie Thr Met Ser Asp Asn Thr Ala Ala Asn Leu Leu Leu Thr Thr 

555 560 565 

ate gga gga ccg aag gag eta acc get ttt ttg eac aac atg ggg gat 5380 
lie Gly Gly Pro Lys Glu Leu Thr Ala Phe Leu His Asn Met Gly Asp 

570 575 580 

cat gta act cgc ctt gat cgt tgg gaa ccg gag ctg aat gaa gcc ata 5428 
His Val Thr Arg Leu Asp Arg Trp Glu Pro Glu Leu Asn Glu Ala lie 
585 590 595 

cca aac gac gag cgt gac acc acg atg cct gta gca atg gca aca acg 5476 
Pro Asn Asp Glu Arg Asp Thr Thr Met Pro Val Ala Met Ala Thr Thr 
600 605 610 

ttg cgc aaa eta tta act ggc gaa eta ctt act eta get tee egg caa 5524 
Leu Arg Lys Leu Leu Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin 
615 620 625 630 

caa tta ata gac tgg atg gag gcg gat aaa gtt gca gga cca ctt ctg 5572 
Gin Leu lie Asp Trp Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu 

635 640 645 

cgc teg gcc ctt ccg get ggc tgg ttt att get gat aaa tct gga gee 5620 
Arg Ser Ala Leu Pro Ala Gly Trp Phe lie Ala Asp Lys Ser Gly Ala 

650 655 660 

ggt gag cgt ggg tct cgc ggt ate att gea gca ctg ggg cca gat ggt 5668 
Gly Glu Arg Gly Ser Arg Gly lie He Ala Ala Leu Gly Pro Asp Gly 
665 670 675 

aag ecc tec cgt ate gta gtt ate tac acg acg ggg agt cag gea act 5716 
Lys Pro Ser Arg He Val Val He Tyr Thr Thr Gly Ser Gin Ala Thr 
680 685 690 

atg gat gaa ega aat aga cag ate get gag ata ggt gcc tea ctg att 5764 
Met Asp Glu Arg Asn Arg Gin He Ala Glu He Gly Ala Ser Leu He 
695 700 705 710 

aag cat tgg taa etgtcagaee aagtttactc atatatactt tagattgatt 5816 
Lys His Trp 

taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatcteatga 5876 

ccaaaatccc ttaacgtgag ttttcgttee actgagegtc agaeeccgta gaaaagatca 5936 

aaggatcttc ttgagateet ttttttetge gegtaatctg etgettgcaa acaaaaaaac 5996 

caccgctace ageggtggtt tgtttgccgg ateaagagct accaactctt tttccgaagg 6056 

taactggett cageagagcg cagataccaa atactgtcct tctagtgtag cegtagttag 6116 

gecaecactt caagaactct gtagcaccgc ctacatacet cgetetgeta atcctgttae 6176 

cagtggctgc tgeeagtggc gataagtcgt gtettaccgg gttggactca agaegatagt 6236 
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taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 62 96 

agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 63 56 

ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 6416 

gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 6476 

acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 6536 

acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 6596 

tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 6656 

ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggcgc 6716 

gccagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 6776 

atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 6836 

tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 6896 

gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 6956 

cgccaagctt gcatgccaat tctcatgttt gacagcttat catcgataag ctttaatgcg 7016 

gtagtttatc acagttaaat tgctaacgca gtcaggcacc gtgt atg aaa tct aac 7072 

Met Lys Ser Asn 
715 

aat gcg etc ate gtc ate etc gge ace gte acc ctg gat get gta ggc 7120 
Asn Ala Leu lie Val lie Leu Gly Thr Val Thr Leu Asp Ala Val Gly 
720 725 730 

ata ggc ttg gtt atg ccg gta ctg ccg ggc etc ttg egg gat ate gte 7168 
lie Gly Leu Val Met Pro Val Leu Pro Gly Leu Leu Arg Asp lie Val 
735 740 745 

cat tee gac age ate gee agt cac tat gge gtg ctg eta gcg eta tat 7216 
His Ser Asp Ser lie Ala Ser His Tyr Gly Val Leu Leu Ala Leu Tyr 
750 755 760 765 

gcg ttg atg caa ttt eta tgc gca ecc gtt etc gga gca ctg tee gac 7264 
Ala Leu Met Gin Phe Leu Cys Ala Pro Val Leu Gly Ala Leu Ser Asp 

770 775 780 

egc ttt ggc cgc cgc cea gte ctg etc get teg eta ctt gga gee act 7312 
Arg Phe Gly Arg Arg Pro Val Leu Leu Ala Ser Leu Leu Gly Ala Thr 

785 790 795 

ate gac tac gcg ate atg gcg ace aca cec gtc ctg tgg ate etc tac 7360 
lie Asp Tyr Ala lie Met Ala Thr Thr Pro Val Leu Trp lie Leu Tyr 
800 805 810 

gee gga egc ate gtg gee ggc ate acc gge gee aca ggt gcg gtt get 7408 
Ala Gly Arg lie Val Ala Gly lie Thr Gly Ala Thr Gly Ala Val Ala 
815 820 825 
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ggc gcc tat ate gcc gac ate acc gat ggg gaa gat egg get egc eae 7456 
Gly Ala Tyr lie Ala Asp lie Thr Asp Gly Glu Asp Arg Ala Arg His 
830 835 840 845 

tte ggg etc atg age get tgt tte gge gtg ggt atg gtg gca gge cee 7504 
Phe Gly Leu Met Ser Ala Cys Phe Gly Val Gly Met Val Ala Gly Pro 

850 855 860 

gtg gee ggg gga ctg ttg gge gee ate tee ttg eat gca cea tte ett 7552 
Val Ala Gly Gly Leu Leu Gly Ala He Ser Leu His Ala Pro Phe Leu 

865 870 875 

gcg gcg gcg gtg etc aae gge etc aac eta eta etg ggc tge tte eta 7600 
Ala Ala Ala Val Leu Asn Gly Leu Asn Leu Leu Leu Gly Cys Phe Leu 
880 885 890 

atg eag gag teg eat aag gga gag cgt ega eeg atg cee ttg aga gcc 7648 
Met Gin Glu Ser His Lys Gly Glu Arg Arg Pro Met Pro Leu Arg Ala 
895 900 905 

tte aac cea gtc age tec tte egg tgg gcg egg ggc atg act ate gtc 7696 
Phe Asn Pro Val Ser Ser Phe Arg Trp Ala Arg Gly Met Thr He Val 
910 915 920 925 

gcc gca ett atg act gtc tte ttt ate atg caa etc gta gga eag gtg 7744 
Ala Ala Leu Met Thr Val Phe Phe He Met Gin Leu Val Gly Gin Val 

930 935 940 

ccg gca gcg etc tgg gtc att tte ggc gag gac egc ttt egc tgg age 7792 
Pro Ala Ala Leu Trp Val He Phe Gly Glu Asp Arg Phe Arg Trp Ser 

945 950 955 

gcg acg atg ate ggc ctg teg ett gcg gta tte gga ate ttg eae gcc 7840 
Ala Thr Met He Gly Leu Ser Leu Ala Val Phe Gly He Leu His Ala 
960 965 970 

etc get caa gee tte gtc act ggt cee gee acc aaa cgt tte ggc gag 7888 
Leu Ala Gin Ala Phe Val Thr Gly Pro Ala Thr Lys Arg Phe Gly Glu 
975 980 985 

aag eag gcc att ate gcc ggc atg gcg gcc gac gcg ctg ggc tac gtc 7936 
Lys Gin Ala He He Ala Gly Met Ala Ala Asp Ala Leu Gly Tyr Val 
990 995 1000 1005 

ttg etg gcg tte gcg acg ega ggc tgg atg gcc tte cee att atg 7981 
Leu Leu Ala Phe Ala Thr Arg Gly Trp Met Ala Phe Pro He Met 

1010 1015 1020 

att ett etc get tee ggc ggc ate ggg atg ecc gcg ttg eag gcc 8026 
He Leu Leu Ala Ser Gly Gly He Gly Met Pro Ala Leu Gin Ala 

1025 1030 1035 

atg ctg tee agg eag gta gat gac gac cat eag gga eag ett caa 8071 
Met Leu Ser Arg Gin Val Asp Asp Asp His Gin Gly Gin Leu Gin 

1040 1045 1050 
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gga teg etc gcg get 


ctt 3LCC aac 


eta act teg ate act 


Qcia ecQ 


8116 


Gly Ser Leu Ala Ala 


Leu Thi" Ser 


Leu Thr Ser He Thr 


Gly Pro 






1055 




1060 




1065 




ctg ate gtc acg gcg 


att tat cicc 


gee teg gcg age aca 


tcrcf aac 


8161 


Leu lie Val Thr Ala 


He Tvr Ala 


Ala Ser Ala Ser Thr 


Tro Asn 






1070 




1075 




1080 




ggg ttg gca tgg att 


gta ggc gee 


gee eta tac ctt gte 


tge etc 


8206 


Gly Leu Ala Trp lie 


Val Gly Ala 


Ala Leu Tyr Leu Val 


Cys Leu 






1085 




1090 




1095 




cce geg ttg cgt cgc 


ggt gca tgg 


age egg gee ace teg 


ace tga 


8251 


Pro Ala Leu Arg Arg 


Gly Ala Trp 


Ser Arg Ala Thr Ser 


Thr 






1100 




1105 








atggaagccg 


gcggcacete 


getaaeggat 


tcaccactcc 


aagaattgga 


geeaateaat 


8311 


tcttgcggag 


aactgtgaat 


gcgcaaacca 


acccttggca 


gaacatatcc 


atcgcgtccg 


8371 


ccatctccag 


cagccgcacg 


cggegeatet 


cggggtegae 


tetagaggat 


cecegcaacg 


8431 


ctgteagcgc 


tttccagtta 


aaeggeteca 


acgtcgccat 


aggtaattee 


tcgeccggcc 


8491 


atacgatcgg 


gcaggtgccg 


ttggctateg 


ecgtcgcctg 


actcatcaea 


etatetteeg 


8551 


ctgcatcgcg 


aagggttttg 


aecaettett 


ccatctctcc 


gtgegeegga 


tgccatgctc 


8611 


acgtacgcgg 


cttatcagat 


agtegggeag 


gccgtcgttc 


cagcccaatg 


aggggaaget 


8671 


ggcgtggagc 


gatgccagca 


eetgctcctc 


aaeaeegtaa 


tggccggegg 


egaacaggca 


8731 


ttcggcggta 


agcgcttcca 


geeetttaat 


catcacgctg 


cggcacatct 


tgatagccga 


8791 


cacgetgcca 


acgtggttac 


eaceatagcg 


ggegttacat 


ccaagegtgg 


tgagtaatte 


8851 


agcaattgcc 


tctgcetgtg 


gtcccccegt 


eaacagcggc 


gttcggagtg 


cccctggggg 


8911 


gaccggegcc 


atcaccgcta 


catcgacata 


agcgccgggc 


ttaaagcatt 


tggcagcctg 


8971 


acgcttggte 


tgcggggcga 


cggagttaag 


gteaagaaaa 


taetgegtgt 


cggtcatcag 


9031 


cggtgcagct 


tgtgaggcga 


catccagggc 


ggatcccgcg 


gtgacggtgg 


aaaatatgag 


9091 


ttcggcacct 


gtcaacgcgt 


cagecaggga 


gattgecgce 


egcacgcctc 


eeegatgcgc 


9151 


cttegttatc 


ategcatcgc 


get caggace 


ttgcagcttg 


caatcccaga 


cggtgactgg 


9211 


gttcactttt 


gccagtgcat 


/"* Si Si i~ 

t-.c-y(_a.a.ya.a.L- 


gceacetgct 


tcaccataae 


cu ataaacgt 


9 2 71 


tattgtcgtc 


ataacagctc 


cttacgcgge 


eacacgtegg 


ccggaatgea 


aacgtcgccc 


9331 


gcgaaeagaa 


gtcgcgccgt 


acgcagcaga 


ecgcagcctg 


ccaaetgcec 


attatcatea 


9391 


agccggagcg 


ccacgctgaa 


ttgggtaccg 


ageteegaat 


tgggtaccga 


getcgaatta 


9451 


attcgagctc 


ggtaeecggg 


gatcnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


9511 



nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 



nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
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nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
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nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



9571 
9631 
9691 
9751 
9811 
9871 
9931 
9991 
10051 
10111 
10171 
10231 
10291 
10351 
10411 
10471 
10531 
10591 
10651 
10711 
10771 
10831 
10891 
10951 
11011 
11071 
11131 
11191 
11251 
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nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11311 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 113 71 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 114 31 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 114 91 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnn 11544 



<210> 13 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic primer 
<400> 13 

ctgttccgtt accaacac 18 



<210> 14 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic primer 
<400> 14 

gtgaatggga tcacgagt 18 



<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic primer 
<400> 15 

accatcagca tcaacgccca acaacg 26 

<210> 16 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic primer 
<400> 16 

gactggatgg ttatccgaat aagagagagg 3 0 
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<210> 17 

<211> 11772 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic nucleotide 
construct 



<220> 

<221> modif ied_base 
<222> (144) . . (1143) 

<223> a, c, t, unknown, or other 



<220> 

<221> modif ied_base 
<222> (5753) . . (7567) 

<223> a, c, t, unknown, or other 

<400> 17 

atcgatgaat tgatccgaag ttcctattct ctagaaagta taggaacttc gaattgtcga 60 

caagcttgat ctggcttatc gaaattaata cgactcacta tagggagacc ggaattcccc 120 

tgcaggtcga ctctagagga tcannnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 00 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 36 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 42 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4 80 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 54 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 66 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 72 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 78 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 84 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9 00 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 96 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 102 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 108 0 



r 
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nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 114 0 

nnnagcgcgt tcgccggcaa tgccaacggg ttcaaactgg gaggaaccag cggtcggcaa 12 00 

tcatcgcatt acccgttgcg tcgcgtttgg caacgtgagc aaaggttttg accagaacaa 1260 

taacgcggcg gcgtacggta ataaataaca cgtcatataa aaacggcatt aattatgctt 1320 

tcgggagcaa cgtccagtca ggacagaaac attacttccg caacaacgtg tctctgtctg 1380 

ctbctgcaac ggtcagtaat gcggatgcga gtcaactcat ggatacggac agcggcttcc 144 0 

gcgtccgatt cgtcagcctg gacacctcgc tggctaccgt atctcgtgat aatgacggca 1500 

cgttgccggc aacctcgctg ttccgtctgt cggctaactc aaagctgatt aacgcgggta 1560 

cgaaagagaa atcatattag ttacaaggca gcgcaccgga tttgggcgct tttgaacgca 1620 

attaagtaat ggaatgttgc gtatcaggct gcaatcgcag ccctgttatt ttgggggtga 1680 

aaagattaat caattagtta cagctaccaa cctgtgccca ggaggaatcg ctgcccggaa 1740 

cggatgaagt gtaccagttt gcggtataca ggttcccttt gtagacgatc gattggcctg 1800 

cttcgttatg agtcgctgcc ggcccgccca gtctttgcta acccagttgg ggtaaacgtt 1860 

ggcattgcag caatcagcgg tttgccggtg tgtcggttgt ggtcggttcg tcaacggtgg 1920 

tgccatggtg gtatcggttg actgatcggt tgttgtactg gcggcgctgc ccgctttata 1980 

tggccagctt tgaatgatcg attttactat tttacccgac tcggtcaggt ttttagagtc 2040 

cggataatag gttgatgccc cttcgctttt atcatttaac gcccagtttg cgttgctgat 2100 

gttgttgtca cgcatgaacg ttacccaggc gtcggtatct gtctggttca ctccgccatt 2160 

gccgtccgcg ttaacggcgc cccactcggt gacgaaaagc gcaataccgt tatttaacgc 222 0 

ctggcgggct ttagtgcgta atgactcacc atgggttccc gcgtagaaat gcagcgtata 2280 

ggcgatattc ttggcgttga ttggatcgcg cgacgcttca tcaacgtttt gcgaccaact 2340 

999cgtaccg acaataatca ggttatccgg gtcaatggcg cgaatggcgg aaatcacggc 2400 

ttcggcataa ggtttaatgg tattgctcca tgaaacctga agcggctcgt tgtagatttc 2460 

ataaatgaca ttcggcttgt tgccatattt gcgcgccatt tcctggaaga agcgaatggc 2520 

ttcactgcga ttgttttctg cagaatgtga gtgccagtca ataatcacat acatatcgtt 2580 

ggcgattgcg gcatccacca ctctttcaac tttggccttg ttgccagccg ggtcctgcag 2640 

ataaccaccg ctttcctgaa cgcccatagc ggcgcgaaca atgctggatt tccagtcttt 2700 

tttcagcgac gcaacggtat cggctgtgta gaatttttcc ccaccccaac cattattact 2760 

ccagaataag ctgttgccgg caaaactttt ggctttttca cctgcgtaga ttttattgcc 2820 
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gctaacggat 


aacggctcaa 


cactcgccca 


ggcattactg 


gaaaggcagg 


caaggcttaa 


2880 


tcctaaacag 


gcacaagata 


gaaacagttt 


tttacgtggg 


gcgtgttttt 


ggctatcgat 


2940 


gactggatgg 


ttatccgaat 


aagagagagg 


cataaatgaa 


tctccatttc 


atagactcta 


3000 


gaggatcacg 


aaaatcaggc 


agcgcttaaa 


tctgccaaaa 


caccggaggc 


tgcttcatta 


3060 


tggcagcgca 


aggcaatttc 


agtataaagg 


gcatcacgat 


cgaaatcaat 


gctgccgccc 


3120 


tgaatggctt 


ccatcaggtc 


ttctgcgccg 


acaacttctg 


caccagcagc 


tttagcttct 


3180 


tcagctttcg 


ggccacgggc 


gaaaacaccg 


acgcgaacat 


ttttaccggt 


gcccttcggc 


3240 


aaggtaacaa 


caccacgaac 


catctgatca 


aggtattaag 


gccgccggac 


aggtaaaatg 


3300 


gttatggcta 


accgatgccg 


atattgcgca 


tcaggatgat 


accttgaaaa 


tgttggttgc 


3360 


tcatgccgaa 


aaagacggat 


tgatttttca 


ttccctgatg 


gcgcgactaa 


gttgtaacaa 


3420 


ttttgcagaa 


tgggcgctta 


ttccggcctt 


cgtctttttc 


ttccagatgt 


tgtatcct tt 


3480 


tcggcatatc 


aataatccca 


agcataaaac 


ggccggtgct 


gccggtggat 


gcatgttagc 


3540 


cgatcatcat 


ctgtgctggt 


ttggataaac 


cgttactaat 


ctggcgcgcg 


acccgataga 


3600 


cagcggcttc 


cccagccccc 


agtgagctac 


caatagcaag 


99tgccaccc 


tgctggaaaa 


3660 


tggagtctag 


gatttcatta 


aaactgacgc 


ttaaagtgaa 


tttccacatc 


ccttcaacgg 


3720 


gcgtttttct 


ctgaaatatt 


tttcttatcg 


gaaatgctcg 


ctccgtatat 


tggtgaaaga 


3780 


gataaatgcc 


agcgtaacta 


caggtgacaa 


aaagcgtgaa 


tfegcgtcagg 


caccatataa 


3840 


acaaaaaata 


ccccaaaggc 


atatgaagcc 


aataaccaat 


gccacaacct 


ccggttctga 


3900 


cgcaggtcgt 


aataaactca 


taaatagtga 


ccagtttaaa 


gcggttacac 


agccgcaaca 


3960 


tcccggttgg 


accagccgat 


at tcataaaa 


agtataatca 


gcatacaaag 


caaggcatct 


4020 


ggcttgacct 


cggcaggcca 


tcccaatctt 


gaagtgccta 


aaatcaagat 


accgcctaac 


4080 


cctaccaaca 


tacctatcgc 


cgcactaaaa 


aaatcggctc 


tgatgcaaaa 


ggcaaggaca 


4140 


tcatcaaatt 


ggttaaaatc 


tttttcctga 


aaagcttttg 


aaccgtaatg 


cagcaaggtt 


4200 


tgccatgact 


gaaaacgggt 


aatatccgaa 


atcagggtag 


caaaagtcgt 


aatcaaaagc 


4260 


atgacaccaa 


acaggttcaa 


tcccaatgtc 


tgggatgtcc 


atgcgacata 


gacaaaactg 


4320 


cataccgcat 


tcaaaacacg 


gcctgtaatc 


agaatgccgg 


tattgccaat 


gatacgggaa 


4380 


aagacagatc 


gaatttgatg 


ccgtggataa 


attgccggtt 


gatcgatccc 


cgggtaccga 


4440 


gctcgaattc 


cgagcttggc 


gcgcctatgc 


ggtgtgaaat 


accgcacaga 


tgcgtaagga 


4500 


gaaaataccg 


catcaggcgc 


cattcgccat 


tcaggctgcg 


caactgttgg 


gaagggcgat 


4560 



cggtgcgggc 

taagttgggt 

tccatcaacg 

cacccaccct 

tcggcaaaga 

ttcgcgatcg 

ttcgccaggt 

tgttatcgct 

ttgaagaatt 

tggcggcctg 

ttcgctacga 

cggattactg 

aagtgcgct t 

aagagattat 

acacgctgcg 

agggcgacta 

tctatccgga 

tcgtttcggc 

ccaacctggc 

cggagctgat 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 



ctcttcgcta 
aacgccaggg 
cttgctgtaa 
gagcgttgag 
cccggctatc 
tatggttgag 
ttactacctg 
cggcatttat 
aattgatgaa 
cttcctcgat 
ctacgggatg 
gctggaatac 
cggcggacgc 
tgccgaagcc 
cctgtggagc 
cttcgcggcg 
tgactcgacc 
gacggtgcag 
ggacaaaatc 
gcgcctgctg 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



ttacgccagc 
tt ttcccagt 
ccaggagcca 
gcgttaaagc 
gctaacaagc 
cgctggctgc 
tcgatggaat 
gaggatgtga 
gaaaacgacc 
tcgcttgcgg 
tt taagcaga 
ggtaacccgt 
attcagcagg 
tatgaccaga 
gcccaggcca 
gtggaagata 
tattcaggac 
gacatcctca 
gcgattcatc 
attgacgagc 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
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tggcgaaagg 
cacgacgttg 
aagctatgaa 
actctattgc 
atgaatggct 
gctcaaaccg 
ttttgattgg 
acagcgcgct 
cgggcttagg 
cgctggggtt 
atatcgtcga 
gggagttcga 
aaggtaaata 
ttatccctgg 
gcagcgagat 
aaaaccattc 
gcgagctgcg 
gccgccacta 
tcaacgacac 
ataagatcag 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



99gatgtgct 

taaaacgacg 

tgtacctttt 

ttataagctg 

caacgccacg 

cgcgcacgtc 

ccgtacgttg 

ggaagagatg 

caacggcggt 

accgggccgc 

tgggcggcag 

gcgccataat 

ctcccgctgg 

cttcgacacc 

taacctcggt 

cgagaacgtg 

cctgcggcag 

ccagctgcat 

gaacccggtg 

ctgggatgag 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 



gcaaggcgat 
gccagtgaat 
agctactcgt 
atgtttatca 
ctgttcgccg 
tctcaggaag 
tccaacgcgc 
gggctgaacc 
cttggtcgtc 
ggctacggta 
aaagaatccc 
acgcgctaca 
gtggagaccg 
gacgccacca 
aaattcaacc 
tcgcgggtac 
gagtacttcc 
aaaacctacg 
ctgtcgattc 
ggnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 



nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnngaa 

ggactgaaag 

agtgtcattc 

caattttgat 

ttcttttctg 

tattctgacg 

tgctgttttg 

ctatggtaaa 



nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

ttcgagctcg 

gtttcgacgg 

atgacgatcc 

caataaagcg 

attacctgtc 

catctgacaa 

gaaccgttgg 

gacggcgaag 



nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

gtacccgggg 

cggttgtgtg 

agttcgatat 

tttgacctga 

ctgttaacct 

ttggttccaa 

gtatcaaact 

aaggacgcat 
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nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

atcgacataa 

ggttttgctt 

tcaacagacc 

tgcatgaggg 

gtggatatag 

tgacctgaag 

tattcgcgag 

catcattgta 



nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

ccgataggtc 

gcccatctgg 

gtctttgtaa 

taaatccatt 

aaggtcggtt 

aaggcgcgca 

gtcgaaggac 

aagcctatta 



nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

ctgcattgat 

cggcatgaat 

tcggcaccga 

cgttcggttg 

caatgagtag 

tcttttatga 

agcgttttgc 

^tggtgaagc 



6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
' 8040 
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cgctaccgct 


ggaaatggta 


tcactatcgg 


tttggcagcg 


ccttctgatg 


aagctgtcga 


8100 


tgctttttat 


aaagcaggct 


tggctaatgg 


cggtaaggat 


gccggagaac 


cggggcctcg 


8160 


tccggctgct 


aataat tctc 


ggggtgccta 


tttatatgac 


cctgaaggca 


ataaaatctg 


8220 


cgctttcaat 


tttaaataag 


atttctttgg 


tgcagggtta 


ttcaaaatag 


ccctgcattt 


8280 


tcagtattat 


agcggccatt 


atggcttttg 


ccttgataaa 


aaatttatca 


gggctgtttt 


8340 


tcgtgatgaa 


tatttttgat 


ttttcaagaa 


aagcctgata 


tcttccaaca 


tctttctttg 


8400 


tatataaatg 


gagcgagcta 


tggcgcgcgt 


aactgtcgaa 


gactgtatcg 


ataaagttca 


8460 


taatcgtttc 


gatttgatcc 


tctagagtca 


acctgcttgt 


tactcgtgat 


cccattcaca 


8520 


agggcgaatt 


aattcgccct 


tctgttccgt 


taccaacact 


tgagccggag 


gcataatggg 


8580 


aaagccaatg 


tggcgttgtt 


gggcgttgat 


gctgatggtg 


tggttcagtg 


cgtcggctac 


8640 


ggcggcgaac 


ggctgggaaa 


tctataaaag 


ccgtttcatg 


accacggacg 


ggcgcattca 


8700 


ggataccggc 


aataagaatg 


tcagccacac 


cgaaggtcag 


ggat tcgcca 


tgctgatggc 


8760 


ggtgcattac 


gatgaccgca 


tcgcgttcga 


taacctgtgg 


aactggacgc 


aaagccacct 


8820 


gcggaacacg 


accagcggct 


tgttctactg 


gcgttacgat 


ccgtcggcgg 


ccaatccggt 


8880 


ggtggataag 


aacaacgcct 


cggatggcga 


tgtgctgat t 


gcctgggcgt 


tgttaaaagc 


8940 


gggaaataag 


tggcaggaca 


accgttacct 


gcaggcgtcg 


gacagcatcc 


agaaagcgat 


9000 


catcgccagc 


aatatcattc 


agtttgcggg 


ccgcaccgtg 


atgttgcccg 


gcgcctatgg 


9060 


t ttcaacaag 


aacagctatg 


tgatccttaa 


cccgtcgtat 


ttcctgttcc 


cggcctggcg 


9120 


cgact ttgct 


aaccgcagcc 


atcttcaggt 


gtggcggcaa 


ctgattgacg 


acagcctgtc 


9180 


attggtcgga 


gaaatgcgtt 


tcggtcaggt 


cgggctgccg 


acggactggg 


cggcgctgaa 


9240 


cgcggatggc 


tcgatggcgc 


cggcgacggc 


ctggccgtcg 


cgtttcagtt 


acgacgccat 


9300 


tcgtatcccg 


ctgtatttgt 


actggtatga 


cgccaaaacc 


acggcgctgg 


tgccgt tcca 


9360 


gctgtactgg 


cgtaactatc 


cccgcctgac 


gacgccggcc 


tgggttgatg 


tgctgagcag 


9420 


taacaccgcg 


acttacaata 


tgcagggcgg 


tttgctggcg 


gtgcgcgacc 


tgacgatggg 


9480 


caacctcgac 


gggctcagcg 


atctgccagg 


cgcatcggaa 


gattactact 


cgtcgagcct 


9540 


gcgcctgctg 


gtgatgttgg 


cgcgcggtaa 


ataaccttat 


tcttgcggta 


cacatggcga 


9600 


ggacgatgtc 


cttgccattt 


tccccacttt 


tatccctctg 


aatggcgtgt 


tt ttcacgct 


9660 


ttgttaacct 


gcttgttact 


cgtgatccca 


ttcacaaggg 


cgaattgacc 


tgcaggcatg 


9720 


caagcttggc 


gtaatcatgg 


tcatagctgt 


ttcctgtgtg 


aaattgttat 


ccgctcacaa 


9780 
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ttccacacaa 


catacgagcc 


ggaagcataa 


agtgtaaagc 


ctggggtgcc 


taatgagtga 


9840 


gctaactcac 


attaattgcg 


ttgcgctcac 


tgcccgcttt 


ccagtcggga 


aacctgtcgt 


9900 


gccagctgca 


ttaatgaatc 


ggccaacgcg 


cggggagagg 


cggtttgcgt 


attgggcgct 


9960 


cttccgctgg 


cgcgccaggt 


cgactctaga 


ggatccccgg 


ggaagatctt 


ccggaagatc 


10020 


ttcccgagct 


cgaattaatt 


ccgcgatgaa 


ttgatcccgg 


aagttcctat 


tctctagaaa 


10080 


gtataggaac 


tcgaattggt 


cgacaagcta 


gcttgcatgc 


aagcttgtat 


tctatagtgt 


10140 


cacctaaatc 


gtatgtgtat 


gatacataag 


gttatgtatt 


aattgtagcc 


gcgttctaac 


10200 


gacaatatgt 


acaagcctaa 


ttgtgtagca 


tctggcttac 


tgaagcagac 


cctatcatct 


10260 


ctctcgtaaa 


ctgccgtcag 


agtcggtttg 


gttggacgaa 


ccttctgagt 


ttctggtaac 


10320 


gccgttccgc 


accccggaaa 


tggtcagcga 


accaatcagc 


agggtcatcg 


ctagcccatg 


10380 


gctaattctg 


tcagccgtta 


agtgttcctg 


tgtcactgaa 


aattgctttg 


agaggctcta 


10440 


agggcttctc 


agtgcgttac 


atccctggct 


tgttgtccac 


aaccgttaaa 


ccttaaaagc 


10500 


tttaaaagcc 


ttatatattc 


ttttttttct 


tataaaactt 


aaaaccttag 


aggctattta 


10560 


agttgctgat 


ttatattaat 


tttattgttc 


aaacatgaga 


gcttagtacg 


tgaaacatga 


10620 


gagcttagta 


cgttagccat 


gagagcttag 


tacgttagcc 


atgagggttt 


agttcgt taa 


10680 


acatgagagc 


ttagtacgtt 


aaacatgaga 


gcttagtacg 


tgaaacatga 


gagcttagta 


10740 


cgtactatca 


acaggttgaa 


ctgcggatct 


tgcggccgca 


aaaattaaaa 


atgaagtttt 


10800 


gacggtatcg 


aaccccagag 


tcccgctcag 


aagaactcgt 


caagaaggcg 


atagaaggcg 


10860 


atgcgctgcg 


aatcgggagc 


ggcgataccg 


taaagcacga 


ggaagcggtc 


agcccattcg 


10920 


ccgccaagct 


cttcagcaat 


atcacgggta 


gccaacgcta 


tgtcctgata 


gc99tccgcc 


10980 


acacccagcc 


ggccacagtc 


gatgaatcca 


gaaaagcggc 


cattttccac 


catgatattc 


11040 


ggcaagcagg 


catcgccatg 


ggtcacgacg 


agatcctcgc 


cgtcgggcat 


ccgcgccttg 


11100 


agcctggcga 


acagttcggc 


tggcgcgagc 


ccctgatgct 


cttcgtccag 


atcatcctga 


11160 


tcgacaagac 


cggcttccat 


ccgagtacgt 


gctcgctcga 


tgcgatgttt 


cgcttggtgg 


11220 


tcgaatgggc 


aggtagccgg 


atcaagcgta 


tgcagccgcc 


gcattgcatc 


agccatgatg 


11280 


gatactttct 


cggcaggagc 


aaggtgagat 


gacaggagat 


cctgccccgg 


cacttcgccc 


11340 


aatagcagcc 


agtccct tec 


cgcttcagtg 


acaacgtcga 


gcacagctgc 


gcaaggaacg 


11400 


cccgtcgtgg 


ccagccacga 


tagccgcgct 


gcctcgtctt 


ggagttcatt 


cagggcaccg 


11460 


gacaggtcgg 


tcttgacaaa 


aagaaccggg 


cgcccctgcg 


ctgacagccg 


gaacacggcg 


11520 
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gcatcagagc agccgattgt ctgttgtgcc cagtcatagc cgaatagcct ctccacccaa 11580 

gcggccggag aacctgcgtg caatccatct tgttcaatca tgcgaaacga tcctcatcct 11640 

gtctcttgat ccactagatt attgaagcat ttatcagggt tattgtctca tgagcggata 11700 

catatttgaa tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa 11760 

agtgccacct gc 11772 



<210> 18 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 18 

atatttttga tttttcaaga aaagcctgat atcttccaac atctt 45 

<210> 19 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 19 

gatttgatcc tctagagtca acctgcttgt tactcgtgat cccat 45 

<210> 20 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 20 

gagtcaacct gcttgttact cgtgatccca ttcacaaggg cgaa 44 



<210> 21 
<211> 42 
<212> DNA 

<213> Artificial Sequence 



24 



<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 21 

ttactcgtga tcccattcac aagggcgaat taattcgccc tt 



<210> 22 
<211> 427 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
amino acid sequence 

<400> 22 

Met Pro Leu Ser Tyr Ser Asp Asn His Pro Val lie Asp Ser Gin Lys 
15 10 15 



His Ala Pro Arg Lys Lys Leu Phe Leu Ser Cys Ala Cys Leu Gly Leu 

20 25 30 



Ser Leu Ala Cys Leu Ser Ser Asn Ala Trp Ala Ser Val Glu Pro Leu 
35 40 45 



Ser Val Ser Gly Asn Lys lie Tyr Ala Gly Glu Lys Ala Lys Ser Phe 
50 55 60 



Ala Gly Asn Ser Leu Phe Trp Ser Asn Asn Gly Trp Gly Gly Glu Lys 
65 70 75 80 



Phe Tyr Thr Ala Asp Thr Val Ala Ser Leu Lys Lys Asp Trp Lys Ser 

85 90 95 



Ser lie Val Arg Ala Ala Met Gly Val Gin Glu Ser Gly Gly Tyr Leu 

100 105 110 



Gin Asp Pro Ala Gly Asn Lys Ala Lys Val Glu Arg Val Val Asp Ala 
115 120 125 



Ala lie Ala Asn Asp Met Tyr Val lie lie Asp Trp His Ser His Ser 
130 135 140 



Ala Glu Asn Asn Arg Ser Glu Ala lie Arg Phe Phe Gin Glu Met Ala 
145 150 155 160 



Arg Lys Tyr Gly 



Pro Leu Gin Val 

180 



Val lie Ser Ala 
195 



Gly Thr Pro Ser 
210 



lie Asn Ala Lys 
225 



His Gly Glu Ser 



lie Ala Leu Phe 

260 



Gly Gly Val Asn 
275 



Asp Asn Asn lie 
290 



Gly Ala Ser Thr 
305 



Lys lie Val Lys 



Ala Ala Ser Thr 

340 



Pro Leu Thr Asn 
355 



Asn Lys Pro Asn 
165 



Ser Trp Ser Asn 



lie Arg Ala lie 

200 



Trp Ser Gin Asn 
215 



Asn lie Ala Tyr 
230 



Leu Arg Thr Lys 
245 



Val Thr Glu Trp 



Gin Thr Asp Thr 

280 



Ser Asn Ala Asn 
295 



Tyr Tyr Pro Asp 
310 



Ser lie lie Gin 
325 



Thr Thr Asp Gin 



Arg Pro Gin Pro 

360 
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Val lie Tyr Glu 
170 



Thr lie Lys Pro 
185 



Asp Pro Asp Asn 



Val Asp Glu Ala 

220 



Thr Leu His Phe 
235 



Ala Arg Gin Ala 
250 



Gly Ala Val Asn 
265 



Asp Ala Trp Val 



Trp Ala Leu Asn 

300 



Ser Lys Asn Leu 
315 



Ser Trp Pro Tyr 
330 



Ser Thr Asp Thr 
345 



Thr His Arg Gin 



lie Tyr Asn Glu 
175 



Tyr Ala Glu Ala 
190 



Leu lie lie Val 
205 



Ser Arg Asp Pro 



Tyr Ala Gly Thr 

240 



Leu Asn Asn Gly 
255 



Ala Asp Gly Asn 
270 



Thr Phe Met Arg 
285 



Asp Lys Ser Glu 



Thr Glu Ser Gly 

320 



Lys Ala Gly Ser 
335 



Thr Met Ala Pro 
350 



Thr Ala Asp Cys 
365 



Cys Asn Ala Asn Val Tyr Pro Asn Trp Val Ser Lys Asp Trp Ala Gly 
370 375 380 
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Arg Gin Arg Leu lie Thr Lys Gin Ala Asn Arg Ser Ser Thr Lys Gly 
385 390 395 400 



Thr Cys lie Pro Gin Thr Gly Thr Leu His Pro Phe Arg Ala Ala lie 

405 410 415 



Pro Pro Gly His Arg Leu Val Ala Val Thr Asn 

420 425 



<210> 23 
<211> 286 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
amino acid sequence 

<400> 23 

Met Ser lie Gin His Phe Arg Val Ala Leu lie Pro Phe Phe Ala Ala 
15 10 15 



Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu Val Lys Val Lys 

20 25 30 



Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Gly Tyr lie Glu Leu Asp 
35 40 45 



Leu Asn Ser Gly Lys lie Leu Glu Ser Phe Arg Pro Glu Glu Arg Phe 
50 55 60 



Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly Ala Val Leu Ser 
65 70 75 80 



Arg lie Asp Ala Gly Gin Glu Gin Leu Gly Arg Arg lie His Tyr Ser 

85 90 95 



Gin Asn Asp Leu Val Glu Tyr Ser Pro Val Thr Glu Lys His Leu Thr 

100 105 110 



Asp Gly Met Thr Val Arg Glu Leu Cys Ser Ala Ala lie Thr Met Ser 
115 120 125 



Asp Asn Thr Ala Ala Asn Leu Leu Leu Thr Thr lie Gly Gly Pro Lys 
130 135 140 
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Glu Leu Thr Ala Phe Leu His Asn Met Gly Asp His Val Thr Arg Leu 
145 150 155 160 



Asp Arg Trp Glu Pro Glu Leu Asn Glu Ala lie Pro Asn Asp Glu Arg 

165 170 175 



Asp Thr Thr Met Pro Val Ala Met Ala Thr Thr Leu Arg Lys Leu Leu 

180 185 190 



Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin Gin Leu lie Asp Trp 
195 200 205 



Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg Ser Ala Leu Pro 
210 215 220 



Ala Gly Trp Phe lie Ala Asp Lys Ser Gly Ala Gly Glu Arg Gly Ser 
225 230 235 240 



Arg Gly lie lie Ala Ala Leu Gly Pro Asp Gly Lys Pro Ser Arg lie 

245 250 255 



Val Val lie Tyr Thr Thr Gly Ser Gin Ala Thr Met Asp Glu Arg Asn 

260 265 270 



Arg Gin lie Ala Glu lie Gly Ala Ser Leu lie Lys His Trp 
275 280 285 



<210> 24 
<211> 396 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
amino acid sequence 

<400> 24 

Met Lys Ser Asn Asn Ala Leu lie Val He Leu Gly Thr Val Thr Leu 
15 10 15 



Asp Ala Val Gly He Gly Leu Val Met Pro Val Leu Pro Gly Leu Leu 

20 25 30 



28 



Arg Asp lie Val His Ser Asp Ser lie Ala Ser His Tyr Gly Val Leu 
35 40 45 



Leu Ala Leu Tyr Ala Leu Met Gin Phe Leu Cys Ala Pro Val Leu Gly 
50 55 60 



Ala Leu Ser Asp Arg Phe Gly Arg Arg Pro Val Leu Leu Ala Ser Leu 
65 70 75 80 



Leu Gly Ala Thr lie Asp Tyr Ala lie Met Ala Thr Thr Pro Val Leu 

85 90 95 



Trp He Leu Tyr Ala Gly Arg He Val Ala Gly He Thr Gly Ala Thr 

100 105 110 



Gly Ala Val Ala Gly Ala Tyr He Ala Asp He Thr Asp Gly Glu Asp 
115 120 125 



Arg Ala Arg His Phe Gly Leu Met Ser Ala Cys Phe Gly Val Gly Met 
130 135 140 



Val Ala Gly Pro Val Ala Gly Gly Leu Leu Gly Ala He Ser Leu His 
145 150 155 160 



Ala Pro Phe Leu Ala Ala Ala Val Leu Asn Gly Leu Asn Leu Leu Leu 

165 170 175 



Gly Cys Phe Leu Met Gin Glu Ser His Lys Gly Glu Arg Arg Pro Met 

180 185 190 



Pro Leu Arg Ala Phe Asn Pro Val Ser Ser Phe Arg Trp Ala Arg Gly 
195 200 205 



Met Thr He Val Ala Ala Leu Met Thr Val Phe Phe He Met Gin Leu 
210 215 220 



Val Gly Gin Val Pro Ala Ala Leu Trp Val He Phe Gly Glu Asp Arg 
225 230 235 240 



Phe Arg Trp Ser Ala Thr Met He Gly Leu Ser Leu Ala Val Phe Gly 

245 250 255 



He Leu His Ala Leu Ala Gin Ala Phe Val Thr Gly Pro Ala Thr Lys 

260 265 270 



29 



Arg Phe Gly Glu Lys Gin Ala lie lie Ala Gly Met Ala Ala Asp Ala 
275 280 285 



Leu Gly Tyr Val Leu Leu Ala Phe Ala Thr Arg Gly Trp Met Ala Phe 
290 295 300 



Pro lie Met lie Leu Leu Ala Ser Gly Gly lie Gly Met Pro Ala Leu 
305 310 315 320 



Gin Ala Met Leu Ser Arg Gin Val Asp Asp Asp His Gin Gly Gin Leu 

325 330 335 



Gin Gly Ser Leu Ala Ala Leu Thr Ser Leu Thr Ser lie Thr Gly Pro 

340 345 350 



Leu He Val Thr Ala He Tyr Ala Ala Ser Ala Ser Thr Trp Asn Gly 
355 360 365 



Leu Ala Trp He Val Gly Ala Ala Leu Tyr Leu Val Cys Leu Pro Ala 
370 375 380 



Leu Arg Arg Gly Ala Trp Ser Arg Ala Thr Ser Thr 
385 390 395 



